AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Low-Memory Inference

# Low-Memory Inference

Internvl3 38B FP8 Dynamic
MIT
This is the FP8 static quantization version of OpenGVLab/InternVL3-38B, optimized for high-performance inference using vLLM. It achieves approximately 2x acceleration on vision-language tasks with minimal accuracy loss.
Text-to-Image Safetensors Supports Multiple Languages
I
ConfidentialMind
5,173
1
Nllb 200 Distilled 1.3B Ct2 Int8
NLLB-200 Distilled 1.3B is a neural machine translation model developed by Meta, supporting translation between 200 languages, utilizing CTranslate2 for efficient inference.
Machine Translation Transformers Supports Multiple Languages
N
OpenNMT
101
10
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase